Designing human benchmark experiments for testing software agents | IET Conference Publication | IEEE Xplore